Parity Striping of Disk Arrays: Low-Cost Reliable Storage with Acceptable Throughput

نویسندگان

  • Jim Gray
  • Bob Horst
  • Mark Walker
چکیده

An analysis of mirrored discs and of RAID5 shows that mirrors have considerably better throughput, measured as requests/second on random requests of arbitrary size (up to 1MB). Mirrors have comparable or better response time for requests of reasonable size (less than 100KB). But mirrors have a 100% storage penalty: storing the data twice. Parity striping is a data layout that stripes the parity across the discs, but does not stripe the data. Parity striping has throughput almost as good as mirrors, and has cost/GB comparable to RAID5 designs -combing the advantages of both for high-traffic disc resident data. Parity striping has additional fault containment and software benefits as well. Parity striping sacrifices the high data transfer rates of RAID designs for high throughput. It is argued that response time and throughput are preferable performance metrics. Outline Introduction Why Striping and RAID Are Inappropriate for OLTP Systems Parity Striping: Cheap Reliable Storage Plus High Throughput An Analysis of Mirrors, RAID5, and Parity Stripe Performance Mirrored Discs Parity Stripe Discs RAID5 Discs Applying the Analysis to a 10+2 array The Read-Only Case The Write-Only Case Analyzing a High-Performance Disc Other Benefits of Parity Striping Summary Acknowledgments References Appendix: Spreadsheets Corresponding to the Graphs

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Terabyte EIDE Disk Arrays running Linux RAID5

High-energy physics experiments are currently recording large amounts of data and in a few years will be recording prodigious quantities of data. New methods must be developed to handle this data and make analysis at universities possible. Grid Computing is one method; however, the data must be cached at the various Grid nodes. We examine some storage techniques that exploit recent developments...

متن کامل

1 9 N ov 2 00 4 Multi - Terabyte EIDE Disk Arrays running Linux RAID 5

High-energy physics experiments are currently recording large amounts of data and in a few years will be recording prodigious quantities of data. New methods must be developed to handle this data and make analysis at universities possible. Grid Computing is one method; however, the data must be cached at the various Grid nodes. We examine some storage techniques that exploit recent developments...

متن کامل

Swift/RAID: A Distributed RAID System

The Swift VO architecture is designed to provide high data rates in support of multimedia type applications in general-purpose distributed environments through the use of distributed striping. Strþing techniques place sections of a single logical data space onto multiple physical devices. The original Swift prototype was designed to validate the architecture, but did not provide fault tolerance...

متن کامل

Self-Repairing Disk Arrays

As the prices of magnetic storage continue to decrease, the cost of replacing failed disks becomes increasingly dominated by the cost of the service call itself. We propose to eliminate these calls by building disk arrays that contain enough spare disks to operate without any human intervention during their whole lifetime. To evaluate the feasibility of this approach, we have simulated the beha...

متن کامل

The Zebra Striped

Zebra is a network file system that increases throughput by striping the file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an approach similar to a log-structured file system. Thm provides high performance for writes of small files as well as for reads and writes of large ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990